Data Analysis with Python and PySpark

Data Analysis with Python and PySpark

  • Downloads:2611
  • Type:Epub+TxT+PDF+Mobi
  • Create Date:2022-03-26 06:51:45
  • Update Date:2025-09-06
  • Status:finish
  • Author:Jonathan Rioux
  • ISBN:1617297208
  • Environment:PC/Android/iPhone/iPad/Kindle

Summary

Data Analysis with Python and PySpark is a carefully engineered tutorial that helps you use PySpark to deliver your data-driven applications at any scale。 This clear and hands-on guide shows you how to enlarge your processing capabilities across multiple machines with data from any source, ranging from Hadoop-based clusters to Excel worksheets。 You’ll learn how to break down big analysis tasks into manageable chunks and how to choose and use the best PySpark data abstraction for your unique needs。 By the time you’re done, you’ll be able to write and run incredibly fast PySpark programs that are scalable, efficient to operate, and easy to debug。

Download

Reviews

Alex Ott

good intro into PySpark, including even the ML pieces。 I've read the MEAP version。 good intro into PySpark, including even the ML pieces。 I've read the MEAP version。 。。。more